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The CMOS 6502 

A new version of the 6502 microprocessor does more than save 
power—it includes powerful new instructions 



Rockwell has introduced a CMOS 
(complementary metal-oxide semi- 
conductor) version of the 6502 micro- 
processor that fills a number of gaps 
in the standard 6502's instruction set 
while offering the low power-con- 
sumption advantages of CMOS tech- 
nology. Pin and software compatible 
with the standard 6502 chip, the 
CMOS version (designated the 
R65C02) promises to extend the range 
of applications that 6502-based pack- 
ages can serve. 

A mainstay of the personal-com- 
puter industry since the first Apple 
computer was produced, the stan- 
dard 6502 microprocessor has a sim- 
ple, straightforward instruction set 
and simple interfacing requirements. 
The instruction set at first appears to 
be restricted in comparison to other 
8-bit processors such as the Z80, but, 
in practice, the simplicity of the in- 
struction set often yields a shorter, 
faster program for common micro- 
processor applications. The instruc- 
tion set does have restrictions on the 
use of certain addressing modes with 
some instructions and has several 
minor anomalies that are poorly 
documented. 



by Steven Hendrix 

In this article I will discuss some of 
the 6502's lesser-known deficiencies 
and the changes in the CMOS ver- 
sion that correct some of these prob- 
lems. I will also review the CMOS 
version's instructions and added ad- 



Several 6502 

instructions don't 

beliave as you migfit 

expect tliem to. 

dressing modes, and finally I will 
describe some hardware interfacing 
considerations. 

Quirks of the 6502 

Several instructions on the 6502 do 
not behave as the documentation 
would have you believe. These irreg- 
ularities rarely affect programs, 
which makes them more difficult to 
debug when they do enter into a pro- 
gram. The quirks discussed here per- 
tain to the return-from-interrupt in- 
struction, the branch-instruction tim- 
ing, the absolute indirect-addressing 
mode, and bus cycles on certain 
index-addressing modes. The CMOS 



version's design has not altered the 
return-from-interrupt and branch- 
instruction timing; therefore, the in- 
formation presented on these topics 
pertains to both the standard and 
CMOS versions of the 6502. The 
CMOS version's design, however, 
has corrected the absolute indirect- 
addressing mode and bus-cycle 
anomalies. 

RTI versus RTS 

The RTI (return-from-interrupt) in- 
struction appears functionally equiv- 
alent to the sequence PIP (puU status 
register from stack), RTS (return from 
subroutine). An interrupt is acknowl- 
edged at the end of an instruction, at 
which time the processor pushes the 
contents of the program counter on 
the stack, high byte followed by low 
byte, and then pushes the processor- 
status byte on the stack before jump- 
ing through the interrupt vector to 
the interrupt-handling routine. 

The difference between the RTI in- 
struction and the PLP, RTS sequence 
lies in the sequence in which the pro- 
gram counter is incremented. During 
a JSR (jump to subroutine), the value 
pushed on the stack is the address of 
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the third byte of the JSR instruction. 
Thus, the program counter is re- 
loaded during an RTS instruction and 
then incremented before the attempt 
to fetch the next instruction. An in- 
terrupt pushes the address of the first 
byte of the next instruction to be ex- 
ecuted, so the RTI instruction reloads 
the program counter and fetches the 
next instruction without first in- 
crementing the program counter. 
This difference becomes especially 
important in writing software for 
tracing or single-stepping functions. 

Branch-Instruction Timing 

The branch-instruction timing 
problem lies not with the 6502, but 
rather with its documentation. The 
original data sheets specify the tim- 
ing correctly, but several independent 
tutorials have incorrectly stated how 
long a branch instruction takes. 

Unlike most other 6502 instruc- 
tions, a branch instruction requires a 
variable number of clock cycles— 
from two to four, depending on the 
circumstances surrounding the 



branch. 

During the first clock cycle (bus cy- 
cle), the processor fetches the branch 
op code. The second cycle fetches the 
second byte of the instruction, which 
is the offset to be used if the branch 
is taken. 

Several independent 
tutorials have confused 
6502 branch- 
instruction timing 
considerations. 

If the branch condition (flag set or 
cleared) is not met, the fetch for the 
next instruction occurs during the 
next clock cycle. If the branch is 
taken, the next cycle is used to add 
the offset to the low-order byte of the 
program counter. If there is a carry or 
borrow from this operation (consider- 
ing the offset to be a signed value), 
a fourth clock cycle is used to update 
the high-order byte of the program 
counter. 
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The net result is that a branch that 
is not taken requires two clock cycles. 
A branch to a location within the 
same page requires three clock cycles, 
and only in the case of a branch that 
crosses a page boundary does the in- 
struction require the full four cycles. 
Typical timing loops, especially for 
intervals under a millisecond or so, 
require close attention to these details 
of the branch-instruction timing. 

Absolute Indirect 
Mode Wraparound 

The absolute indirect-addressing 
mode works only with the JMP 
(jump) instruction. In normal use, it 
is a 3-byte instruction: the first byte 
contains the o°p code (6C)(all instruc- 
tions and addresses are specified in 
hexadecimal); the second byte con- 
tains the low-order part of a memory 
address; and the third byte contains 
the high-order part of that address. 
The processor loads the byte at the 
referenced address into the low half 
of the program counter, and it loads 
the byte in the next higher memory 
location into the high half of the pro- 
gram counter. Thus, the instruction's 
effect is to jump to the location 
specified by the two bytes stored at 
the address given in the instruction. 
A problem arises, however, when 
the jump destination is stored with 
the two bytes split between two 
memory pages (that is, if the second 
byte of the instruction is FF). The pro- 
cessor loads the referenced byte into 
the low half of the program counter 
and attempts to increment the ad- 
dress given in the instruction to load 
the high byte. However, it disregards 
the carry from the increment opera- 
tion on the low byte of the address, 
with the result that the high byte of 
the program counter is loaded from 
the memory location 255 bytes prior 
to the referenced location. 

In table 1 the JMP instructions il- 
lustrate this problem. The left-hand- 
column code operates correctly, load- 
ing the value A345 into the program 
counter. The right-hand-column 
code, however, does not correctly 
load the value A345 into the pro- 
gram. It does load the value 45, 
stored at location 02FF, into the pro- 
gram counter's low-order byte, but 
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A345, JMP (0200) 

0200 45 

0201 A3 

Result: A345 - PC 



A345 JMP (02FF) 

0200 59 

02FF 45 

0300 A3 

Result: 5945 - PC 



Table 1: Two sets of memory contents il- 
lustrating operation of the 6502 fMP in- 
struction. The left-hand column of code 
operates as expected, but due to an instruc- 
tion-set anomaly, the right-hand column's 
code yields an unexpected result because 
the program counter's desired high-order 
byte resides in a different page of memory 
than does the low-order byte. 



rather than transferring to the next 
page of memory to obtain the high- 
order program-counter byte from 
location 0300, it incorrectly loads the 
value stored at location 0200 (59 in 
this case) into the program counter's 
high-order byte. 

This anomaly can cause major 
problems when you attempt to devel- 
op general-purpose table-driven soft- 
ware. If the appUcation program does 
not contain special code to insure that 
an indirect jump never references an 
address at the end of a page, unpre- 
dictable behavior that is difficult to 
trace can result. The R65C02 report- 
edly handles the absolute indirect- 
addressing mode correctly for all 
cases. 



Spurious Bus-Read Cycles 

A rare problem with I/O (input/ 
output) devices can occur because of 
the nature of the 6502 bus. Two spe- 
cific factors combine to cause this 
problem: all I/O is memory-mapped, 
and there is no such thing as an in- 
active bus cycle. In some cases, in- 
dexed instructions can lead to inad- 
vertent accesses to I/O devices be- 
cause of these two facts. 

The 6502 treats memory and I/O 
ports alike, viewing both as memory. 
As a result, a system's decoding hard- 
ware causes I/O ports to appear at 
specific locations that look like part 
of the memory-address space to the 
6502. A "read" bus cycle addressing 
a port acts as an "input" operation, 
and a "write" cycle acts as an 



<>i 




'^ 



446 December 1983 © BYTE PubUcations Inc. 



"output" operation. 

The 6502 does not have separate 
pins for a "read" and a "write" signal, 
as do other processors such as the 
8080 or the Z80. Instead, the R/W 
(read/write) signal is used to 
designate a "read" cycle if it is in a 
high state or a "write" cycle if it is in 
a low state. Timing is coordinated by 
the Phase 2 clock. If the read/write 
line is high when the Phase 2 clock 
is high, the device whose address ap- 
pears on the address bus places data 
on the data bus. If the read/write line 
is low while the Phase 2 clock is high, 
the addressed device accepts data 
from the bus. 

To show how indexed instructions 
can interfere with I/O devices, let's 
examine the bus cycles carried out to 
load the accumulator from an abso- 
lute address indexed by the X regis- 
ter. In standard 6502 mnemonics, this 
load instruction is LDA ADDR,X. 
This instruction takes four cycles un- 
less the indexing crosses page 
boundaries, in which case it takes 
five. The latter is the troublemaker. 
During the first cycle, the 6502 
fetches the op code. The second and 
third cycles are used to fetch the low 
and high bytes of ADDR, respective- 
ly. If the indexing operation does not 
cross a page boundary, the sum of 
ADDR and X is placed on the ad- 
dress bus during the next cycle, and 
the A register is loaded from the data 
bus, finishing the instruction. If a 
page boundary is crossed, however, 
a partially formed address is placed 
on the bus during cycle four and the 
actual load happens in a fifth cycle. 
R)r normal memory access, the fifth 
cycle does no harm because it is a 
read cycle, resulting in memory plac- 
ing data on the bus but no registers 
or memory being changed by it. 
(Even if the instruction is a store in- 
struction, the cycle involving this par- 
tially formed address is a read cycle.) 
Certain I/O devices, however, are 
affected by read operations. Ibr in- 
stance, a spurious read from a 6850 
ACIA (Asychronous Communica- 
tions Interface Adapter) could reset 
the "receive data register full" flag, so 
that a later operation would find that 
data was not available. Various other 
I/O devices such as parallel ports and 
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counter/timers can also be affected by 
spurious reads. If the indexed ad- 
dress crosses a page boundary from 
the page in which the I/O device re- 
sides, the partially formed address 
placed on the bus during the fourth 
bus cycle can trip the I/O device. The 
R65C02 reportedly corrects this prob- 
lem. 

New Instructions 

The R65C02 includes a number of 
new instructions, making it more 
powerful than the 6502. (The text box 
"An Assembler for the R65C02" on 
page 452 describes an assembler that 
supports the R65C02's extended in- 
struction set.) Conditional branching 
based on the state of any bit in page 
0, an unconditional short relative 
branch, stack operations for the X 
and Y registers, the ability to set or 
clear any individual bit in page 0, 
zeroing any byte in memory, and a 
"test and reset" or "test and set" 
memory bit instruction have been 
added. 
The BBRx (branch on bit reset) in- 



structions permit any bit in page to 
be used as a flag. These are 3-byte in- 
structions, with the op code in the 
first byte, the page-0 address of the 
byte containing the flag in the second 
byte, and the relative jump displace- 
ment in the third byte. Bits 6 through 



The R65C02 includes a 
number of new 

instructions, maicing it 
more powerful than 
the standard 6502. 

4 of the op code give (in binary) the 
number of the bit within the page-0 
byte to be tested. The processor reads 
the byte from page 0, checks the bit 
designated by the op code, and con- 
tinues normal program flow if the 
designated bit is a 1. If it is a 0, a nor- 
mal signed relative short branch is ex- 
ecuted, using the third byte of the in- 
struction for the offset. The BBSx 
(branch on bit set) instructions do the 




same thing except that they take4li& 
branch only if the referenced bit is set 
to 1. 

Unconditional Short Branch 

The unconditional short-branch in- 
struction (BRA) eases writing of po- 
sition-independent code and in some 
cases allows shorter code. With the 
6502, a sequence such as SEC (set 
carry), BCS (branch if carry set) is 
sometimes necessary to cause an un- 
conditional position-independent 
jump. Even that sequence requires 3 
bytes, as does a normal absolute 
jump (JMP). The BRA instruction 
permits an unconditional, posi- 
tion-independent branch requiring 
only 2 bytes. 

Four new stack-manipulation in- 
structions have been added to act on 
the X and Y registers. In 6502 pro- 
grams, the X and Y registers could be 
pushed only by transferring them 
first to the A register. Thus, the nor- 
mal sequence for saving the registers 
for an interrupt routine went some- 
thing like this: PHA (push the A 
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register on the stack), TXA (transfer 
X to A), PHA, TY\ (transfer Y to A), 
PHA. This sequence required extra 
time and memory and also made it 
difficult for a routine to save and 
restore all the registers and make use 
of a value passed to it in the A 
register. The four new instructions 
permit direct pushing and pulling of 
both the X and Y registers. 

Set and Clear Page-0 Bits 

Companions to the BBRx and BBSx 
instructions, the RMBx (reset-mem- 
ory bit) and SMBx (set-memory bit) 
instructions permit setting and clear- 
ing single-bit flags in page without 
affecting any internal processor reg- 
isters accessible to the programmer. 
As before, bits 6 through 4 of the op 
code specify which bit is affected, 
and the second byte of the instruc- 
tion specifies the page-0 location 
affected. 

The new STZ (store zero) instruc- 
tion permits zeroing an entire byte 
anywhere in memory without affect- 
ing processor registers. Four available 
addressing modes allow a 2-byte 
form for page-0 operations and a 
3-byte form for general addresses, 
either of which may be indexed by 
the X register. 

The TRB (test and reset bits) in- 
struction is a composite of the 6502 
BIT (bit test) and AND (logical and) 
instructions. The N (negative) flag is 
set to the value of bit 7 of the refer- 
enced memory location, and the V 
(overflow) flag is set to the value of 
bit 6. A logical AND is then per- 
formed between the referenced 
memory location and the A register, 
with the result stored into the mem- 
ory location (A is unaffected), and 
the Z (zero) flag is changed to in- 
dicate the result of this operation (set 
if the result is 0, reset if it is nonzero). 
Note that, just as on the 6502, the N 
and V flags pertain to the value in 
memory before the AND operation 
takes place. The TSB (test and set 
bits) instruction is similar except that 
a logical OR is substituted for the 
logical AND operation. 



ing addressing modes to be used 
with instructions that did not accept 
those modes on the original 6502. It 
also adds an entirely new addressing 
mode usable with a number of pres- 
ent instructions that should prove 
useful in making better use of the 
processor registers. 

The 6502 has no simple indirect- 
addressing mode other than the JMP 
instruction. With no 16-bit registers to 
hold addresses, 6502 programs fre- 
quently keep addresses in page 0, es- 
pecially when passing addresses to 
and from subroutines. However, the 
only way to use those addresses to 



tions as a logical AND except that the 
result is discarded, it is normally 
used to test flags. Most such tests 
would be most conveniently done 
with an immediate addressing mode, 
which is not permitted. Instead, 6502 
programs must use a backward form 
of logic, loading the test mask using 
the immediate mode and then doing 
the test on the data directly from 
memory. 

The R65C02 BIT instruction permits 
additional addressing modes— im- 
mediate, 0-page indexed, and abso- 
lute indexed. These added modes 
cover the vast majority of the situa- 
tions in which this instruction would 
be used. 



The R65C02 includes a 

simple Indirect- 
addressing mode using 
a 2-byte address. 



access the data to which they point 
is through the pre- or post-indexed 
indirect-addressing modes. Thus, a 
common sequence in programs con- 
sists of loading the Y register with 0, 
followed by an operation using the 
"indirect, indexed by Y" addressing 
mode. Not only does this sequence 
result in extra code requiring addi- 
tional memory space and execution 
time, but it ties up the Y register, 
which might be better used in other 
ways. 

The R65C02 corrects this deficien- 
cy by adding a simple indirect-ad- 
dressing mode, which uses a 2-byte 
address stored in page 0. This ad- 
dressing mode can be used with all 
the major accumulator instructions: 
ADC (add with carry), AND (logical 
and), CMP (compare memory with 
accumulator), EOR (logical exclusive- 
or), LDA (load accumulator from 
memory), ORA (logical inclusive-or), 
SBC (subtract with borrow), and STA 
(store accumulator to memory). 



Addressing Modes 

In addition to totally new instruc- 
tions, the R65C02 enables some exist- 



New Modes for BIT 

The BIT (bit test) instruction of the 
6502 is severely limited in addressing 
modes. This instruction accepts only 
two modes: absolute (direct) and 
page. Because this instruction func- 



Increment and 
Decrement Accumulator 

Arithmetic on the X and Y registers 
is not permitted by the 6502; neither 
is incrementing or decrementing the 
accumulator. Though such a need is 
rare, it does arise, and the lack of an 
accimiulator-addressing mode for the 
increment and decrement instruction 
results in various kludges to get the 
desired result. Three alternate ways 
are commonly used. The most ob- 
vious is to use the ADC (add with 
carry) instruction to add an im- 
mediate value of 1. Because the 6502 
does not provide a simple "add" in- 
struction (without carry), this alter- 
nate method also requires a preced- 
ing CLC (clear carry) instruction, un- 
less the state of the carry bit from 
prior operations is known. Alterna- 
tively, setting the carry bit followed 
by adding an immediate value of 
accomplishes the same thing. 

If the X or Y registers are not in use 
at the particular point in the pro- 
gram, it is possible to transfer the 
value from the A register to one of 
those registers and take advantage of 
the increment or decrement instruc- 
tions for X and Y. A third method, 
most commonly used when the next 
step is to store the accumulator value 
in memory, is to store the A register 
value first and then increment it in 
memory, because the INC (incre- 
ment) and DEC (decrement) instruc- 
tions accept several different address- 
ing modes for operations directly on 
data in memory. 



December 1983 © BYTE Publications Inc. 451 



Circle 488 on inquiry card. 



^*s«^ '^-iiacs' 



SAVE AT ELEK-TEK 
OIM PRINTERS 



HUGE SAVINGS ON 
ALL EPSON PRINTERS 

MX 80 FT MX 100 
FX80 FX100 



CALL FOR SUPER 
LOW PRICES 



,111111 



> 



EPSON RX-80 
275.00 

8750 Ribbon Cartridges for Epson 

80 Column Printers 4.00 

8755 Ribbon Cartridges for Epson 

132 Column Printers 7,00 




'\S^%^^ / GEMINI iOX 
^^ W^ / DOT MATRIX PRINTER 
I /. — 

GEMINI 10X 
$275.00 

GEMINI 15 
15 in. wide carriage 

$399.00 

NEW HIGH SPEED 

DELTA 10 — call for price 

Gem 01 Ribbons for Gemini Printers - 6 for 15.00 
12 for 24.00 

Cables for Epson or Gemini 

PA10A 10 ft 36/38 pin 

standard parallel 30.00 

IB-P10 10 ft. 36/25 pin parallel 

for IBM 32.00 

PA6T 6 ft. 36/16 pin parallel 

for Tf-99/4A 25.00 

RS10A 10 ft. 25 pin 

standard RS-232[full loaded] 21.00 

RS1Y RS-232 Y cable for TI-99/4A . . 35.00 



PRINTER INTERFACES 
OISCOUNTED TOO! 



LETTER QUALITY PRINTERS 

S500— S1,550 
TTX — COMREX — DIABLO 



CALLTOLLFREE 8dd-621-12iB9 

EXCEPT Illinois, Araska, Hawaii 



Corp. Accts. invitad. Min Ord, 515.00 Mastercard or Visa by 
mail or phone Mail Casher's Check. Money Ord.. Fters. Check [2 wks 
to cir J Add S4 00 1st item. (AK, HI, P.R., Canada add S10.00 
first itamJSI. GO ea add'Ishpg & handl. Shipments to iL address 
add 6% tax. Prices subi to change WHITE for fraa catalog. 
Raturn policy for dafsctiuH on arrival rBplacamanta only; BO 
day mfr. wty. ALL ELEK-TEK MERCHANDISE IS BRAND 
NEW, FIRST QUALITY AND COMPLETE. 




The R65C02 eliminates all of this 
foolishness by allowing the accumu- 
lator-addressing mode to be used 
with the increment and decrement 
instructions, enabling them to oper- 
ate on all three of the general-pur- 
pose registers. 

Hardware Factors 

The R65C02 has the electrical char- 
acteristics you would expect from the 
current generation of CMOS inte- 
grated circuits. Versions for speeds to 
6 MHz will probably be available. 
Power consumption is low and varies 
with speed, as is normal for CMOS 
technology. With the clock stopped, 
10 iiW power consumption is listed as 
maximum. Maximum power con- 
sumption in normal operation is 
listed as 4 mA (20 mW) per MHz, 
making battery-powered operation 
feasible when this chip is combined 
with the new CMOS memory chips. 

Rockwell claims that the basic 
R65C02 version is pin and software 
compatible with the 6502. Another 
version, the R65C102, can generate all 
clock signals on-chip; it needs only 



quiring a two-phase clock input that 
would be generated by the system 
master processor. 

Summary 

The CMOS version of the 6502 chip 
fills in a number of gaps in the 6502 
instruction set in addition to adding 
the obvious advantages of CMOS 
technology. The characteristics of the 
new chip permit the 6502 to expand 
in both directions into areas that were 
previously impractical. Completely 
battery-powered systems are now 
feasible for small, dedicated applica- 
tions. Additionally, the added bus 
control permits multiple-processor 
systems and sophisticated direct- 
memory-access schemes to be used 
with this processor. 

Perhaps the most impressive fea- 
ture of the CMOS version is complete 
compatibility with the 6502 specifica- 
tions, permitting the enormous base 
of 6502-based hardware and software 
to be used with the newer processor. 
The R65C02 processor represents a 
step above the 6502 similar to the step 
from the 6800 to the 6502, without 
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an external TTL (transistor-transistor v^ the accompanying compatibility 
logic) level single-phase clock input '^ problems. The current popularity of 
(as does the 6502) or an external RC'* 6502-based personal computers pro- 
network or crystal. It also has a quad- -^ vides a large market for new applica- 
rature clock output, which is not pro- ^ tions of this processor. ■ 
vided by the 6502. This clock goes^ 
high in the middle of the phase-1,^ 
clock and returns low in the middle ^ 
of the phase-2 clock. <j 

The 6502 has not been commonly :» 
used in applications requiring multi- 
ple processors or direct-memory ac- -^ 
cess, largely because it cannot float its .^ 
address bus. Both the R65C102 and ■ 
another version, the R65C112, have -' 
signals to permit bus sharing. The 
bus-enable (BE) signal permits an ex- 
ternal device to cause the processor 
to float the address and data buses 
and the R/W signal, permitting 
access to the system buses. To pre- 
vent bus arbitration from interfering 
with read-modify-write instructions 
such as shifts and increments, a 
memory-lock (ML) output signal is 
provided to notify external devices 
that the processor cannot relinquish 
the bus until completion of the 
instruction. The R65C112 is designed 
to be used as a slave processor, re- 



An Assembler for the R65C02 

HEXASM is a full-feature resident 
assembler that supports the R65C02 mi- 
croprocessor's extended instruction set. Jn 
addition to including such features as 
macros, conditional-assembly, and source- 
file-chaining functions, it can optionally 
be configured to either accept or reject con- 
structs that are unique to the R65C02. 
HEXASM runs under HEXDOS on Ohio 
Scientific's OSI CIP and is available for 
$38.50 from Hx Computer Products, Route 
8, Box 81E, New Bmunfels, TX 78130 or 
The 6502 Program Exchange, 2920 West 
Moana, Reno, NV 89509. 
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